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Abstract 

In this paper we present a study of the mixing time of a random walk on the largest 
component of a supercritical random graph, also known as the giant component. We identify 
local obstructions that slow down the random walk, when the average degree d is at most 
Vln nln In n, proving that the mixing time in this case is 0((ln n/d) 2 ) asymptotically 
almost surely. As the average degree grows these become negligible and it is the diameter 
of the largest component that takes over, yielding mixing time 0(ln n/ln d). We proved 
these results during the 2003-04 academic year. Similar results but for constant d were later 
proved independently by I. Bcnjamini, G. Kozma and N. Wormald in [3]. 

1 Introduction 

Given a graph G with vertex set V n = {1, 2, .., n}, the simple random walk on G is the Markov 
chain where for every edge ij of G, the transition probability pij from i to j is ^y. I.e. we 
exit a vertex via a uniformly chosen edge. Formally, we have defined the entries of an n by n 
tranistion matrix P for the chain (where Pij = if ij is not an edge) and the distribution of the 
last point in at step walk from initial distribution xq is xqP 1 . 

This chain (is ergodic and therefore) has a limit distribution precisely if G is connected 
and non-bipartite. In this case, the limit distribution ir satisfies 7Tj = 2 \e(g)\ wnere ^00 > the 
degree of i, is the number of edges of G incident to i (we extend this notation to sets letting 
d(S) be the sum of the degrees of the vertices in S). Furthermore, the chain is reversible, as 

KiPij = ^jPj,i = 2\E{G)\ ' 

We are interested in the mixing time of this chain for various random graphs on V n . In this 
setting, we say that an event occurs asymptotically almost surely (a.a.s.) if its probability tends 
to 1 as n — > oo. 

We consider the mixing time convergence with respect to the total variation distance dxv 
between two probability distributions on V n defined as: 



drv (p {1 \p {2) ) = max pW(A) -p i2) (A) 
\ J ACV n 
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Thus the mixing time of the chain is 



TmixiG) = supmin{i : drvipoP 1 , ir) < 1/e}. 



X' 



It is easy to prove that min{t : dry (zqP* , < (2/e) 1 } < lT m i x . So, T m i x measures not only 
how long it takes to get to within 1/e of it, but also bounds how long it takes to get arbitrarily 
close to 7r. Thus, it measures the rate at which the Markov chain mixes. 

If we choose a graph G uniformly at random from all graphs on V n then a.a.s. 



Vie7„ 

and 



Tl 

d(i) < \7nln n 



2 



V i +3 



\N{i)nN{3)\-\ 



< \pn\vL n, 



where N(i) is the neighbourhood of vertex i in G. It follows easily that a.a.s. for every j 
^U) = n °(n) an d) by counting the number of paths of length 2 from i using the inequalities 
above, that a.a.s. 

Vt^j, P?(j) = -+o n 



n \n 

So a.a.s. T m i x (G) = 2, which is also the diameter of G. 

We shall consider the random graph G n ^ p on V n where each edge is present independently 
with probability p, and hence the expected degree of a vertex is p(n — 1). In what follows we 
always couple the use of p and d by insisting that d = pn; this is essentially the expected degree 
and makes our formulas a little easier than if we used d to represent the actual expected degree. 

In the same vein, we can prove: 

Theorem 1.1 For every p = p{n) with d — In n = uj(1) we have that a.a.s. 

In n 



In d 



< 3. 



This result improves on earlier work of Hildebrand [12] who had determined the mixing time up 
to a multiplicative factor for p = &7( ln n ). Its easy proof, given in a companion paper [8], relies 
on proving inductively that the number of nodes at the j-th level of the breadth-first search 
tree from an arbitrary vertex i is concentrated around d{i)d^~ l provided this is o(n) (and hence 
generalizing the easy argument above). [8] also uses similar arguments to obtain results on the 
diameter of G n ^ p for d = u;(l), strengthening results in [6] and [TT] . 

The situation for p < is more problematic, as the local structure of the graph begins to 
play a role. For one thing, if In n — d = uj(1) then G ntP almost surely has vertices of degree zero 
and hence is not connected (see for example Theorem 7.3 in [5]). However, if p > for some 
e > then a.a.s. the largest component H n p has order Q(n) vertices whereas the second largest 
component has 0(ln n) vertices. So, for small p, we consider the simple random walk on the 
giant component H n p of G n ^ p . 

A second type of local structure which comes into play at this point are the vertices of H np 
which are far away from any vertex of degree exceeding two. An easy second moment argument, 
given at the end of the paper, shows that: 

Lemma 1.2 For p < a.a.s. H niP contains paths of length more than all of whose 

interior vertices have degree two. 
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Given such a path Q with 21 + 1 vertices, we can label these vertices with consecutive integers 
so that the midpoint x = xq is labelled 0. Now, if we start our random walk at x, then we can 
mimic the initial part of the walk, until we touch an endpoint of Q, by the standard random 
walk on the integers (where we go from i to i — lori + 1 with equal probability) starting at 
0. It is well known (see for example [10] page 349) that we expect to take I 2 steps before we 
have seen an integer with absolute value I in this walk. Thus if x also denotes the distribution 
on V n that gives unit mass to x, it follows easily that xP t (Q) > | for t < A (^r) and so 

rp ^ 1 / In n \ ^ 

J- mix ^ io ^ 8d J ; 

For p < nlnl g _ n this bound is larger than the diameter of H ntP and is tight up to a 
multiplicative factor. For p which exceed V ln rcinin _n Du t, are 0(^-^), it is the diameter which 
correctly approximates T m i x as these paths are too small to exert much influence. So, we can 
determine the first order term in the mixing time precisely. 

In doing so, we actually consider a slightly different definition of the mixing time: for t > 
we let Tf be uniformly distributed on {0, ...,£— 1} and we set 

T m ix(G) = supmin{£ : dTv{xoP T \ tt) < l/ e l- 

x 

This is within a constant factor of many other mixing times (its definition is inspired by 
the properties of the uniform averaging rule - see e.g. [18] Theorem 4.22 or [19j Theorem 5.4 
for further details). It is at most a constant factor larger than T m i x . For the modified chain in 
which we stay in the current state with probability 1/2 and take a step with probability 1/2 in 
each iteration, T m i x is at most a constant factor larger than this mixing time. We have shown 
that actually for the chains we are considering, a.a.s. T' mix is within a constant factor of T m i X 
even without this modification. This result is a consequence of a more general result which will 
appear in a separate paper [9]. In particular, it answers Problem 17 in Section 4.3.3 in pp. 

We show: 



Theorem 1.3 For every p = p{n) with Vln " r | nln " < p < we have that a.a.s. 

t' (r ) lnn 




In d 

Theorem 1.4 For every e > 0, and p = p(n) with < p < — l — - we have that a.a.s. 

T m ix(G n ,p) = O 




The proof of the first of these theorems is similar to but more complicated than that of Theorem 
11.11 and it will be given in a companion paper [8]. In this paper, we handle the more delicate 
situation of small p, proving the second theorem. A similar result concerning the G n ^ m model with 
constant average degree was proved independently by I. Benjamini, G. Kozma and N. Wormald 
in [3]. We remark that as lower bounds on Ffnix are also lower bounds on T^^ x (up to a constant 
factor); our results are tight. 

We close this introductory section presenting some tools which we use later in our proofs. 

For any set S of states of a Markov chain we define Q(S) to be the probability we leave S 
when we are in the steady state; so Q(S) = YlieS jgS^ityPhj an d ^(s) ^ s ^ ne P r °bability that 
we leave S given we are in it. Thus, is the expected length of a sojourn in S when we are 
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in the steady state. It follows easily , see e.g. |7j, that the mixing time of any Markov chain is 
at least maX| 5 . o<tt(s)<|} ioq(s) • 

As in [13J , we define the conductance of S, denoted 3>(S) to be ^(sfyy \s) anc ^ ^ e con " 
ductance $ = &(G) of G to be max^. o<tt(s)<i} ^(S*)- We note that for reversible chains, 
<S>(S) = $(V n \S), so 

T > J- 
lmra - io$' 

Moreover, M. Jerrum and A. Sinclair proved in |13j that the mixing time of an irreducible, 
aperiodic and reversible Markov chain satisfies: 

C 

T' mix < ^ log7T-J n , (1) 

for some constant C. Thus, the mixing time of a reversible Markov chain is approximately 
determined by its conductance. 

In [7J, the authors of this paper, treading a path blazed by L. Lovasz and R. Kannan in |17j . 
proved a strengthening of this result which can be used to tie down the mixing time of many 
Markov chains more precisely. 

For p > ir m i n we let 3>(p) be the minimum conductance of a connected set S with § < n(S) < 
p (if there is no such a set we define = 1). In [7J, we prove: 

Theorem 1.5 For an irreducible, reversible and aperiodic Markov chain on V n we have 

r mix <c 

3=1 

for some constant C that does not depend on the chain. 

Remark: The above sum can be approximated within a constant factor by the integral 
f 1 ^ 2 /o .ibtt \ ■ So if we bound <Hx>) by we obtain (TTT) . However, the bound of Theorem 11.51 is 
often tighter. Such is the case for the chain considered in this paper. 

We apply Theorem [T3] to deal with the mixing time for small p. To do so, we need to bound 
from above the conductance of connected sets of various sizes. 

This completes our preliminary remarks. In the next section, we discuss the precise results 
on the conductance of H UtP that we need to prove Theorem 11.41 and show that they do indeed 
imply this result. In Sections HU we prove these results. In Section EJ we prove Lemma 11.21 
which shows that the bounds of Theorem ll.4l are tight. We close this section with the statement 
of Talagrand's inequality (see for example inequality (2.43) p. 42 in [IS]), which we will use at 
various points in our proofs to derive concentration bounds: there exists a constant 7 > such 
that for any t > 0, if X is a binomially distributed random variable: 

P[|X - ELY] \>t}< 4 e -7i 2 /(E[x]+t) _ ( 2 ) 
2 The Evolution of the Conductance 

We focus now on the conductance of the connected subsets of H niP . To this end, we let e* be the 
number of edges of H UiP and recall that for non-biparite H njP , for every vertex v, tt(v) = $jgr. 
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Letting e(5) = \{{i,j} G £ : i,j G S}| and e out (5) = \{{i,j} : i G S,j 5}|, we have 
that if H n>p is non-bipartite: n(S) = = - — and Q(5) = 6 2e l' 9 ' ) . Thus for such 

non-bipartite H n #, $(S) = (2 e {s)+e°^{s))*{V n \s) wnich is within a factor of 2 of 2e (g) +e °ut(g) and 
so to bound conductance we need to know about the behaviour of these two variables. 

The advantages of focussing on these variables individually, rather than on conductance itself 
are two-fold. The first is that they are defined for all H np not just for non-bipartite ones. The 
second is that we can quickly see that it is e out (S) which gives us difficulty. Indeed, standard 
concentration results easily yield: 

Lemma 2.1 There exists an absolute constant I such that for any p with i < p < 21 " n , a.a.s. 
every connected set S in G np satisfies e(S) < ld\S\. Furthermore, if \S\ = o((ln n) 2 ) then 
e{s)<2\S\. 

The easy proof of Lemma l2.1l is given in the next section. The behaviour of e out {S) is more 
problematic. However, more careful counting arguments allow us to show: 

Lemma 2.2 There exists constants e > 0, c > and do > 1 such that for every p = p(n) 
satisfying < p < 2X \ n , a.a.s. every connected subset S of H n ,p with \S\ > ^p, and 
d(S) < satisfies e out (5) > ed\S\. 

Lemma 2.3 For any two constants c\ and c 2 with 1 < c\ < c 2 there exist e, A > such that 
a.a.s. for d = d(n) lying between c\ and C2, every connected subset S of H UtP with \S\ > Alnn 
and d{S) < satisfies e ont {S) >e\S\. 

Since H ntP is connected we know that e out {S) is at least one for all strict subsets of V(H ntP ). 
Armed with this fact and Lemmas 12.112.21 12.31 we can now give the: 

Proof of Theorem 11.41 It is well known that H n p is a.a.s non-bipartite. So we assume this 
to be the case and apply Theorem 11.51 to obtain: 

riogTr" 1 I 

1 ° mtn I 

T' mix {H ntP )<C Y, *~ 2 (2^')- 
j=i 

We now bound the sum Y&S*"^ $" 2 (2~- ? ). 

Applying Lemma [2TT1 with S = V, we have that e* = 0{dn). Since H n>p is connected, iT m i n 
is at least = ^(i)- So log vr~^ n = 0(ln n). We claim that we can apply Lemmas I2.1|2.21 
and 12.31 to show that there exists absolute constants r and c such that if $ _1 (S') > r then 
w(S) < c ^° n " . So letting / be the set of j such that 2 _J lies between 7r m j„ and c ^ n , our sum is 
bounded by 0(ln n) + Y, je i $~ 2 (f j )- 

But, $(S) > e **/g\ , for such S because H HjP is connected. So, since e* = 0(dn), < £ -2 (2~ J ) = 
0(d 2 n 2 2~ 2 ^) uniformly for j G /. Since this is a geometric sum, it is of the same order as its 
largest term, which is O ^(^p) 2 ^ as required. 

It remains to prove our claim. 

Recalling how we have expressed $ in terms of e out and e, and combining Lemmas 12.11 12.21 
and 12.31 for c\ = ^-^,C2 = do we have: 

For any e > 0, there is a c and an r such that for any p = p(n) which is at most and at 
least iis, a.a.s any connected set S with at least vertices and tt(S) < | satisfies < r 

(note that when d is less than do, it can be treated as a constant). 
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But since $(S) > g if e out (S) > e(S), the second half of Lemma 12.11 tells us that a.a.s 
every such small set whose conductance is small satisfies d(S) < 4\S\ and so tt(S) = O 
Combining this with our bound on the size of S proves the claim and the lemma. 

Lemma 12.21 is easy to prove using Talagrand's inequality. Lemma 12.31 is much more difficult. 
To prove it we consider the core of G UtP which is the maximal subgraph of G n ^ p all of whose 
vertices have degree at least 2. We prove some results about the expansion properties of the core 
and then translate them into results on H n ^ p which imply Lemma 12.31 In doing so, we condition 
on the degree sequence of the core and we use the Bender-Canfield model for graphs with a given 
degree sequence. We discuss this model in Section 14.11 but readers may consult [2] for further 
details. 

In the next section we present some well-known properties of G n ^ p and prove Lemmas 12.11 
Lemma 12.21 and the following, of which we shall have need. 

Lemma 2.4 For every fixed d > 1 there exists L > such that if p = ^ then a.a.s. every 
connected set S C V n in G n ,p with n > \S\ > Inn has d(S) < L\S\. 



The proof of Lemma 12.21 is also presented in Section [3l 

Now Lemma 11.21 guarantees that a.a.s. G n , p contains paths which are connected sets with 
conductance between , , and , , n 2 „ , . Since — ^— us non-decreasing in x, Lemma [XT] below tells 

us that every set S of order k < A p ^^ has conductance at least 2E+1 = ^ (bTn)- On * ne °ther 
hand Lemmas 12.21 and 12.31 tell us that the minimum conductance a set of order at least A p lnn/d 
can have is a.a.s. 17(1). So, if $(H ntP ) denotes the minimum conductance of a subset of H n , p we 
have: 

Theorem 2.5 Whenever d = d[n) > 1 + 0(1), a.a.s, 

•(fl^> = e(.rio{JL,i}). 

3 Some Simple Facts about G n)P 

To prove Lemma 12.11 we simply combine the following two lemmas. 

Lemma 3.1 For every p = p{n) with i < p < 21 " n , a.a.s. every connected set S <^V n of G n , p 
with \S\ < satisfies \S\ - 1 < e(5) < 2|5|. 

Proof. The expected number of sets of 2k edges spanning a set of k vertices is {^2k)P 2k — 

{f) k ^5F^ = (^TT-) k - For k < esk this is less than 2 ~ k - For k less than fH> [t is less 
than s/n k . The result follows by the first moment method. I 

Lemma 3.2 There exists a constant I such that for any d = d(n) between 1 and 2 In n, and 
p = a.a.s. every set S with \S\ > satisfies e(S) < ld\S\. 

Proof. If efc denotes the number of edges on a set of k vertices, then E[e&] = p( 2 ) < As 
is binomially distributed, we can use Talagrand's inequality ([2]) to show that 

P[e fc > E[e fe ] +t] < 4exp 



It 2 



E[e k ]+t 
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for some universal constant 7 > 0. In particular, for a natural number I which will be determined 
soon 

¥[e k > Idk] < F[e k > E[e k ] + (I - l/2)dk) < 4exp ~ ^jf ^ ) < 4ex P (-^) > 

for I sufficiently large. Therefore the expected number of sets with k vertices and at least Idk 
edges is at most 4Qe 2 < 4 ( 2 J < 4 ( 60ecTe 2 J and 

4 ^ (60ed 2 e- 2 ^y = o(l), 

for I a sufficiently large constant. I 

We will need the following simple fact in the proof of Lemma 12.21 It will also be useful later. 
It is an immediate consequence of, for example, Talagrand's inequality. 

Lemma 3.3 For p > a.a.s G n>p has (\ + o(l))pn 2 edges. 

Proof of Lemma 12.21 We specify our choice of e < g below. So we can assume e out (S) = 
e out^y _ j g a £ mos ^ dn ag otherwise we are done. Since d(S) < d ( H £' p ) ; by Lemma [3.31 



d(V -S)>^ and so e{V - S) > @. So, we are done by LemmaOfor S with \S\ > n - ^. 
Key to our analysis is the fact that there are k k ~ 2 labelled trees on k vertices. Thus the 



expected number of trees of Q n ^ p with k vertices is 



n » - k-2 ( a \ ^ „/ i\k 



h k - 1 - < n{edf. (3) 
k J \n J 

Having fixed both a set S of k < n — ^ vertices and the set of edges within S, the expected 
value of e ont (S) is dk(l — k/n) which is at least dkjYll. Since e out (S) is binomially distributed, 
Talagrand's inequality yields 



P[e out (5) < E[e out (5)] -t}< 4exp 
Setting t = dk/24l, we obtain: 



It 2 



E[e out (S)] +t 



*-(S> < § 



( dk \ 



Combining ([3]) and (jU, we obtain that for k < n(l — 1/12/), the expected number of connected 
sets S of G nd j n with \S\ = k and e ont (S) < dk/2Al is at most 

4n(ed) k e-^ im . 

If we choose do such that ede~ 7c( / 100/ < e ~ 7rf / 20W for every d > do, and set c = then 
4n ^ {e l ^ d / im d) k < 4n ^ (e"^ 200 ^ = 4n ^ O = o(l). 



— a — a 
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Proof of Lemma 12.41 Since d(S) = 2e(S) + e out (S) and we have an upper bound on e(S) 
by Lemmas 13.11 and 13.21 it suffices to bound e out (5). For a set S with k vertices E[e out (S')] = 
pk(n — k) = dk (l — ^) < dk. By Talagrand's inequality © 

F[e ont (S) > Idk] < ¥[e out (S) > E[e ont (S)} + (Z - l)dk] < 4exp 

<- «-p(-2?). 

Hence the expected number of connected sets 5 with k vertices and e out (S) which is at least Idk 
is at most 

4n(ecfe -2 2") fc . 
Choosing I > — so that ede~^ ld < e~ 3 we obtain: 

n 

An (ede- 1 ^) k < 4n 2 e" 31nn = o(l), 

fc= [In n] 

and this concludes the proof of the lemma. I 

4 The Core of the Proof 

In this section we obtain a lower bound on e out (S) for any sufficiently large connected subset of 
H n! p, for any d bounded between two constants. To do so, we mainly investigate the expanding 
properties of the core of the biggest component of a random graph, which we will denote 
by C{H n , p ). This is the maximal subgraph of H n)P having minimum degree at least 2. We let 
N = \V(C{H n)P ))\ and M = \E(C(H njP ))\ and recall (see [21]) that a.a.s. both M and N are 
0(n), for the d within the range we are interested. The following lemma bounds below e out (S) 
for any connected S sufficiently large which is a subset of the vertex-set of the core. 

Lemma 4.1 For every d = d{n) with 1 < c\ < d < C2 for two constants c\ and 02, and 
A G (0,1) there exist constants «o = cto(d, A) > and C\ = Ci(d,X) such that a.a.s. every 
connected S C V (C(H n , p )) with Ci Inn < \S\ < XN is joined to C(H rhp ) — S by at least o.q\S\ 
edges. 

The proof of this lemma is postponed until the next subsection. It is reasonably easy because 
the fact that the core has minimum degree two gives it strong expansion properties. 

We are interested in the expanding properties, of connected subsets of H n>p rather than its 
core. That is we would like to bound below e ont (S) in the case where S is an arbitrary sufficiently 
large connected subset of V(H n>p ), using Lemma UTTl To this end, we note that the components 
of H UtP — C(H n-p ) are trees. Each such tree has a unique vertex that is adjacent to a vertex of 
the core; we say that the tree is rooted at that specific vertex. We call these trees decorations 
of the core. To apply the above lemma we need to prove that any sufficiently large connected 
subset of vertices of H n ^ p has at least a certain proportion of its vertices belonging to the core. 

We show that this is indeed the case in the following lemma, which bounds the probability 
that a sufficiently big connected set of vertices of the core has at least / times more vertices in 
the decorations dangling from its vertices. 



Idk 



S 



Lemma 4.2 For every fixed d = d(n) with 1 < c\ < d < C2 for some constants c\ and ci, there 
are constants Z > 1 such that a.a.s. every connected subset S of H n ^ p with xlnn < \S\ < N is 
such that the number of vertices belonging to C(H ntP ) is at least \S\/l. 

Proof. Clearly, if we choose a counterexample S so as to minimize the number of decorations 
which it intersects but does not contain, then it partially contains at most one decoration. I.e., 
it suffices to prove that for some x > 0) f° r an Y sufficiently large integer t, the expected number 
of trees of G U)V with tk > x^ nn vertices, such that (t — l)k of these vertices are incident to 
no edges off the tree and induce a forest tends to as n — ► oo. We can see that the expected 
number of such trees is bounded above by 

CO ( tk } tk ~ 2ptk ~ 1 ( 1 ~ p) [n - tk){tk ~ k)+ ( tk i k y {tk ~ k) . (5) 

If tk < en, then using ([3|) we can see that for any given t and k this is bounded above by 

n/e'^^l-p)^ 1 - 1 /^ 1 ^. 

This is at most nzf^e 02 ^ 6 ^* for z d = e 1+ln d-^C 1 - 6 ). We note that if e is chosen to be 
sufficiently small, then z^ < z Cl < 1. By making t large enough we can make f*^ e c 2(i-<0/* 
smaller than (1/ z Cl ) tk l 2 . Thus, we see that the expected number of such trees having at most 
en vertices is at most 

nz^ n l 2 . 

Setting x = 4/ m (l/ z ci) clearly suffices. 

If tk > en, we need to use more accurate bounds. Since S lies in H n ^ p its number of vertices 
is a.a.s. at most An for some A = \{d) < 1. Then writing tk = an, ([5]) is bounded above by 

0( n ) ( 1 V ( tk \an) an (- Y" e -«fa(i-a)(i-i/t)-n^/2+«3d a Va > 

uniformly for any a G [e, A] . In turn, this is bounded above by 

0(n)( a M e ^M)+3 C2 /2 t ) 
v \an/t) 

where h{a, d) = — (1 — a) ln(l — a) + a In d — cfce(l — a) — da 2 /2. Elementary calculations show 
that for each d > 1 there exists > such that h(a,d) < —Cd, for every a S [e,A]. Taking f 
sufficiently large so that ( Q "'/ t )e n3c2//2< < e nCd ^ 2 , it turns out that © is no more than 



0(n)e 



-nc d /2 



whenever en < tk < An. The result follows. I 

In order to prove Lemma 12.31 we will need another result of a similar flavour. 
A dangling tree is a rooted tree T of G n , v all of whose non-root nodes are incident only to 
the edges of T. 

Lemma 4.3 For every fixed d = d(n) with 1 < c\ < d < C2 for some constants c\ and C2 there 
is a constant I such that setting p = ^, a.a.s. for every i > I, the number of nodes of G„ iP in 
dangling trees of size i or greater is less than ^ . 



9 



Proof. Using (J3j) we can see that for any given j = 0(ln n), the expected number of dangling 
trees of size j is bounded above by: 

nd j e j (l -p^-Vin-i) = 0(nd J e j e - *'). 

As in the proof of the last lemma, this is falling exponentially quickly as j increases. It follows 
that a.a.s every dangling tree has size 0(ln n). Furthermore, a straightforward second moment 
calculation shows that assymptotically almost surely for every j, the number of dangling trees of 
size j is at most twice its expected value. Provided this condition holds, so does the conclusion 
of the lemma. I 

With these three results in hand, we can give the 

Proof of Lemma 12.31 To obtain a lower bound on e out (S) where S is a sufficiently large subset 
of V(H ntP ), we want to use Lemma I3~2l to argue that at least \S\/l of its vertices belong to the 
core and then apply Lemma R~T1 to deduce that S has at least cto\S\/l edges to S. Of course such 
an approach is valid whenever S contains a proportion of vertices of the core that is bounded 
away from 1. As we are interested in bounding the conductance of sets having tt(S) < 1/2, it 
might be the case that S contains almost all of the core or even all of it. In particular, this 
might be the case when d is close to 1 (see |21j for precise bounds on the proportion of vertices 
of Hn tP belonging to the core). Hence, in this case we must argue in a different way as we shall 
see below. 

First we need the following: 

Claim 4.4 Suppose that p is as in Lemma \4-S\ Let T denote a collection of vertex- disjoint trees 
which are induced in H np and let t be the total number of vertices they contain. There exist 
constants x-,1 > 0; such that every such T with all its vertices, except one per tree, incident only 
to edges within T and t > %ln n has a.a.s. \T\ > t/l. 

Proof. The proof of this claim is essentially the same as that of Lemma 14.21 except that in (|5|) 
we replace the factor (tk) tk ~ 2 by k{tk) tk ~ k ~ l , which is the number of forests we can build on tk 
vertices, with k particular vertices belonging to different trees. We omit the details. I 

By the above remarks, to complete the proof of Lemma [2.3l we need only prove for S satisfying 
\S n V{C{Hn )P ))\ > (1 — tq)N ', for a constant to = To(d) to be specified later. 

We note that since S is connected, for any vertex w in V(C{H njP ) — S) the union of the deco- 
rations attached at w is disjoint from S. Let ec{S) denote the number of edges in V(C(H niP ) — S) 
along with the attached decorations. 

Furthermore, any component of H n . p — S disjoint from the core is a dangling tree rooted at 
a vertex which has a neighbour in S. Let t$ be the total number of vertices involved in these 
trees. If T$ is the number of these trees, then e out (S) > T$. We show that Tg > e*/4Z, where I 
is the constant that appears in the above claim. 

Indeed, since Tr(V(H n , p ) — S) > 1/2, we have 

e* < d(V(H n>p ) -S)< 2t s + e out (S) + 2e c (S). (6) 

We may assume that e (S) < e*/4, as otherwise we are done. Since V(C{H ntP ) — S) contains 
at most To-ZV vertices, Lemma 1431 implies that 2ec(S) < e*/4, provided that we choose to small 
enough. Plugging these bounds into ©, we deduce that ts > e*/4. Claim [4^41 yields the bound 
on Ts and the proof of the lemma is now complete. I 
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4.1 The expanding properties of C(H njP ) 

In this section we investigate the expanding properties of the core of the giant component of a 
G n ,d/n random graph, where d is bounded between two constants. We are aiming towards the 
proof of Lemma 14. 1[ 

We actually consider the core of G U)V rather than the core of H n ^ p because it is known that 
C(G n ,p) conditioned on its degree sequence is uniformly distributed over all graphs having this 
degree sequence. This follows from Proposition 2.1(b) in [22\ that describes the distribution of 
the graph that remains from the stripping-off process conditioned on its degree sequence. It is 
also known (again see [22]) that a.a.s. the difference between these two graphs consists of a set 
of 0(ln n) cycles each with at most 0(ln n) vertices. So it is enough to prove the variant of 
Lemma ED obtained by replacing the core of H n ,p by the core of G n , p . 

In doing so, we will use the configuration model of Bender and Canfield (see [2]). Suppose 
we want to analyze a uniformly random graph Gn on Vjy = {1> 2, N} with a given degree 
sequence {di, d/v}. For each i S Vn we take di copies of i, thus forming a set Fjy of 2M 
points. A perfect matching on Pjv corresponds to a multigraph on Vjv where an edge between 
copies of i and j yields an edge from i to j. Note that this may create loops or multiple edges. 
We consider the random multigraph G' N that comes out of a uniformly random perfect matching 
on Pjv- As shown by McKay and Wormald in |20j . under certain technical conditions, results 
proven in this model can be transferred back to a uniformly chosen simple graph on the given 
degree sequence. 

The key to our lemma is the following, which is very easy to prove by considering this 
correspondence. 

Proposition 4.5 For each integer N > 1 let (d±, . . . , djy) be a degree sequence on the set Vjv = 
{1, . . . , N} and let Gn be a random graph having this degree sequence. Assume that 

1. For each N and for i = 1, . . . , N we have di > 2. 

2. 2M = di < J2?=i dj < CN, for some C > 0. 

3. max{(ij}i<j<Ar < lniV ; for N sufficiently large. 

Then uniformly for every set of vertices S with d(S) = Ylies di being an even number, we have 



[There are no edges between S,Vn \ S in Gn] = O 



\d(S)/2j 



(In fact the constant involved in the 0(1) term depends only on C.) 

Proof. Given a degree sequence and a set S of vertices satisfying the conditions of the lemma, 
we generate the perfect matching on P/v, which yields G' N , one edge at a time. 

Exposing the edge out of the first vertex of S, we see that the probability it is joined to a 
vertex of S is d(v<fi^~\))-i ' Given that this edge stays within S, we take a third vertex of S 
and note that the probability that it is joined to a vertex in S is d ^ y ^J 3 )))-3 ' M° re generally: 



P [There are no edges between S, V/v \ S in G' N ] = 
d{S) - 1 d(5) - 3 1 
2M- 1 2M - 3 "' 2M- d(5) + 1 
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d(5)! (2M-d(S))! M\2 M 

(d(5)/2)!2 d ( s )/ 2 (M-d(5)/2)!2 M - d ( s )/ 2 (2M)! 
/ M \ f 2M \ ( M \^ 
{d(S)/2j / \d(S))-{d(S)/2j ■ 

Assumptions (2) and (3) along with the main theorem in [20] transfer this bound to the space 
of a G^v random graph, and we are done. I 

For each n > 1, and any C, c, e > 0,B, we define E n = E n (C,c,e) to be the set of graphs 
on V n such that their core is non-empty, it has N vertices and M edges, where N > cn and 
N/M < 1 — e, maximum degree at most In N and moreover, the sum of the squares of the degrees 
in the core is no more than CN (see condition (2) in the previous proposition). To prove Lemma 
14.11 we will condition on the event E n for a specific choice of C,c and e: 

Remark. There exist C, c, e such that for every 1 < c\ < d < C2 the event E n occurs a.a.s. 
(see 0, [21]). 

Now we are ready to proceed with the proof of Lemma 14.11 
Proof of Lemma 14.11 In fact we shall prove that every S C V(C(Q n ^ p )) with G[S] being 
connected and AiV > \S\ > Cilnn has more than ao\S\ edges joining it with V(C{Q n)P )) \ S, 
for some ao, C\ which will be specified during our proof. For the time being we assume that 
C\ > 1, with the prospect of using Lemma 12.41 

Now for any integer s between C\ Inn and AiV, let X s = X s (d, ao) be the number of subsets S 
of V(C(Q nt p)) with |5| = s,d(S) <Ls,e(S) > s — 1 and having at most ao|iS'| edges joining it with 
V(C(Gn iP )) \ S. Considering such sets is sufficient, since by Lemma l2~il a.a.s. every connected 
set of vertices S with |5| > Cilnn has total degree no more than L\S\. We will condition on 
the event E n and, more specifically, we will show that K[X S \ E n ] = o(l/n) uniformly for every 
s between C\ Inn and AA^. To do so, we shall estimate the conditional expectation of X s given 
a degree sequence of the core of a graph in E n (in the conditional probability space of the event 
E n ), which we denote by E[X S ]. Proving that E[X S ] = o(l/n), i.e. that the random variable 
E[X S ] is bounded above by a function that is o(l/n), uniformly for any degree sequence of the 
core of a graph in E n will be sufficient. Then the lemma will follow since E n occurs a.a.s., if we 
choose C, c and e as in the above remark. 

We proceed with the estimation of E[JT S ], where C\ Inn < s < XN. For any a < ao, if the 
set S has total degree t, then we can choose the edges that will be adjacent to V{C(Q n . p )) \ S in 
at most ( * ) ways. Having specified these elements of the total degree, now the available total 
degree in S is equal to t' = t — as. Then the probability (i.e. the conditional expectation of the 
indicator random variable that is equal to 1 on the event) that every other edge is not adjacent 
to V(C(Qn tP )) \ S is by Proposition 14.51 f( t ^f 2 ) ) > uniformly over all sets S as above. This is 
the case because the premises of Proposition 14.51 are satisfied, by the definition of E n . 

Since s - 1 < t'/2 < M - N + s, the above bound is 0{l) maxjfj^)" 1 , (^J" 1 }- Note 
that by Stirling's formula we have 



( N s ) _ (5-1)! AH (M-a + 1)! _ 1 N N (M — {s — 1)) M ~( S ~ 1 ) 
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Since ( N g ) = ( N N _ s ) and ) = M^±l( ^J, applying © with N - s in place of s yields: 



W ) M ( N x JV " S 



M 



Provided ¥r < (1 — e), both these bounds are 0(Me for some /? = /3(e, A). 

We are now ready to bound E[X S ], for any s as above. Let D = D(s, ao) = {a : as G N, a < 
a } and T(s, a) = {t G N : 2(s - 1) + as < t < min{2(M - N + s), Ls}, t - as is even}. We 
now apply ([7|), and its corrollary, to obtain: 

em = "(')(^E,E ( (1 _l )/2 )"t) 



a£D(s,Q ) iGT(s,o) 



as 



= ow E E (5)' Me " s *' 

«eD(s,a ) t£T(s, a) V 7 

since t is bounded by Ls, for small enough a, < e 2 , and each term in the above sum is at 

most Me~2". Further the sum has 0(s 2 ) terms. So, for ao > sufficiently small and s > C\ Inn, 
where C\ = C\(d) is a sufficiently large constant the sum is o(-). I 



5 Long Induced Paths 

In this section we give the 

Proof of Lemma ll.2i We focus on the set S of those paths of G„ jP all of whose internal vertices 
have degree two but whose endpoints have degree greater than 2. Note that some of these paths 
are edges. We let S' be the mulitset consisting of the interiors of these paths. S' is a multiset 
because it may (indeed a.a.s does) contain many empty paths. 
The expected number of paths in S of length i is at most 

(n — 1)\ 

and at least 

9 (n — i)\ 

It follows that the expected number of paths of length m ^ s UJ ( n3 ^' 1 ) whilst the expected 
number of paths of length exceeding 10 jr n is o(l). The latter result tells us that a.a.s every 
path of S has length at most 10 H - , The first result and a simple second moment calculation 
shows that a.a.s there will be at least ^fn paths of length ^j-p in S. 

Now, we can construct an auxiliary multi-graph G' n p from G UtP by replacing each path in S 
by an edge with the same endpoints (so we delete the vertices on the interior of these paths). By 
our bounds on the size of the paths in S, we know that a.a.s G" has exactly one component of 
size ^(j^r) and this corresponds to H n #. 

We would like to say that given S and G' we can generate G n:P by taking a uniform bijection 
between the paths of S and the edges of G' n p whose endpoints have degrees bigger than 2. This 
may not be true, because G' n p may have multiple edges, and we cannot map two edges of G' n p 
to single edge paths in S. 
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For the moment, we assume that it is true and that G n>p has 0(nln n) edges, there are 
at least y/n paths of length in S, and the component of G' np corresponding to H n ^ p has 
^(npn) edges. Then, the probability that no path of S in H niP has length is at most 

(l-Q (i^)) n which is o(l). But we have seen that these three conditions a.a.s hold, so we 
are done if G' np is a simple graph. 

If G' n p is not simple, instead of considering S we consider the subset S* consisting of those 
paths of S whose endpoints are not joined by another path of S. We let G* n p be the graph 
obtained by replacing the paths of S* by edges. Now, given G* n p and S* , we obtain G„ iP by 
taking a uniform bijection between the paths of S* and the edges of G* whose endpoints both 
have degree at least three and are not joined by a path all of whose internal vertices have degree 
two. An easy first moment calculation shows that \S — S*\ is a.a.s o(|«S|) so we can apply the 
above argument to S* to prove our result. We omit the details. I 



6 Concluding Remarks 

We have investigated some geometric properties of the giant component of supercritical random 
graphs. In particular, we analysed the edge expansion of connected subsets of various sizes, 
concluding that it is the sets of order that have small expansion. As a consequense, these 
sets delay the mixing of a random walk. However, as the average degree grows, these shrink and 
the mixing time is determined by a global parameter which is the diameter. This was shown 
in [12], for average degree at least In 2 n. In a forthcoming paper [8], we give a more detailed 
analysis of this situation for degrees between Vln nlnln n and In 2 n. 
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